PCA-Based Out-of-Sample Extension for Dimensionality Reduction
نویسندگان
چکیده
Dimensionality reduction methods are very common in the field of high dimensional data analysis, where the classical analysis methods are inadequate. Typically, algorithms for dimensionality reduction are computationally expensive. Therefore, their applications to process data warehouses are impractical. It is visible even more when the data is accumulated non-stop. In this paper, an out-of-sample extension scheme for dimensionality reduction is presented. We propose an algorithm which performs an out-of-sample extension to newly-arrived multidimensional data points. Unlike other extension algorithms, such as the Nyström algorithm, the proposed algorithm uses the intrinsic geometry of the data and the properties of dimensionality reduction map. We prove that the error of the proposed algorithm is bounded. Additionally to the out-of-sample extension, the algorithm provides a residual for any new data point that tells us the abnormality degree of this data point.
منابع مشابه
2D Dimensionality Reduction Methods without Loss
In this paper, several two-dimensional extensions of principal component analysis (PCA) and linear discriminant analysis (LDA) techniques has been applied in a lossless dimensionality reduction framework, for face recognition application. In this framework, the benefits of dimensionality reduction were used to improve the performance of its predictive model, which was a support vector machine (...
متن کاملImproving the RX Anomaly Detection Algorithm for Hyperspectral Images using FFT
Anomaly Detection (AD) has recently become an important application of target detection in hyperspectral images. The Reed-Xialoi (RX) is the most widely used AD algorithm that suffers from “small sample size” problem. The best solution for this problem is to use Dimensionality Reduction (DR) techniques as a pre-processing step for RX detector. Using this method not only improves the detection p...
متن کاملEfficient Retrieval Technique for Microarray Gene Expression
The DNA mciroarray gene data is in the expression levels of thousands of genes for a small amount of samples. From the microarray gene data, the process of extracting the required knowledge remains an open challenge. Acquiring knowledge is the intricacy in such types of gene data, though number of researches is arising in order to acquire information from these gene data. In order to retrieve t...
متن کاملA Simple and Efficient Supervised Method for Spatially Weighted PCA in Face Image Analysis
Principal Component Analysis (PCA) is an example of a successful unsupervised statistical dimensionality reduction method, especially in small sample size problems. Despite the well-known attractive properties of PCA, the traditional approach does not incorporate prior information extracted from a specific domain knowledge. The development of techniques that bring together dimensionality reduct...
متن کاملImpact of linear dimensionality reduction methods on the performance of anomaly detection algorithms in hyperspectral images
Anomaly Detection (AD) has recently become an important application of hyperspectral images analysis. The goal of these algorithms is to find the objects in the image scene which are anomalous in comparison to their surrounding background. One way to improve the performance and runtime of these algorithms is to use Dimensionality Reduction (DR) techniques. This paper evaluates the effect of thr...
متن کامل